6,627 research outputs found

    Latent Dirichlet Allocation (LDA) for improving the topic modeling of the official bulletin of the spanish state (BOE)

    Get PDF
    Since Internet was born most people can access fully free to a lot sources of information. Every day a lot of web pages are created and new content is uploaded and shared. Never in the history the humans has been more informed but also uninformed due the huge amount of information that can be access. When we are looking for something in any search engine the results are too many for reading and filtering one by one. Recommended Systems (RS) was created to help us to discriminate and filter these information according to ours preferences. This contribution analyses the RS of the official agency of publications in Spain (BOE), which is known as "Mi BOE'. The way this RS works was analysed, and all the meta-data of the published documents were analysed in order to know the coverage of the system. The results of our analysis show that more than 89% of the documents cannot be recommended, because they are not well described at the documentary level, some of their key meta-data are empty. So, this contribution proposes a method to label documents automatically based on Latent Dirichlet Allocation (LDA). The results are that using this approach the system could recommend (at a theoretical point of view) more than twice of documents that it now does, 11% vs 23% after applied this approach

    A cloud-based tool for sentiment analysis in reviews about restaurants on TripAdvisor

    Get PDF
    The tourism industry has been promoting its products and services based on the reviews that people often write on travel websites like TripAdvisor.com, Booking.com and other platforms like these. These reviews have a profound effect on the decision making process when evaluating which places to visit, such as which restaurants to book, etc. In this contribution is presented a cloud based software tool for the massive analysis of this social media data (TripAdvisor.com). The main characteristics of the tool developed are: i) the ability to aggregate data obtained from social media; ii) the possibility of carrying out combined analyses of both people and comments; iii) the ability to detect the sense (positive, negative or neutral) in which the comments rotate, quantifying the degree to which they are positive or negative, as well as predicting behaviour patterns from this information; and iv) the ease of doing everything in the same application (data downloading, pre-processing, analysis and visualisation). As a test and validation case, more than 33.500 revisions written in English on restaurants in the Province of Granada (Spain) were analyse

    Selecting the W Matrix. Parametric vs Nonparametric Approaches

    Get PDF
    In spatial econometrics, it is customary to specify a weighting matrix, the so-called W matrix, just choosing one matrix from the different types of matrices a user is considering (Anselin, 2002). In general, this selection is made a priori, depending on the user’s judgment. This decision is extremely important because if matrix W is miss-specified in some way, parameter estimates are likely to be biased and they will be inconsistent in models that contain some spatial lag. Also, for models without spatial lags but where the random terms are spatially autocorrelated, the obtaining of robust standard estimates of the errors will be incorrect if W is miss-specified. Goodness-of-fit tests may be used to chose between alternative specifications of W. Although, in practice, most users impose a certain W matrix without testing for the restrictions that the selected spatial operator implies. In this paper, we aim to establish a nonparametric procedure where the chosen by objective criteria. Our proposal is directly related with the Theory of Information. Specifically, the selection criterion that we propose is based on objective information existing in the data, which does not depend on the investigator’s subjectivity: it is a measure of conditional entropy. We compare the performance of our criteria against some other alternative like the J test of Davidson and McKinnon or a likelihood ratio obtained in a maximum likelihood framework.

    Herrera Manuel - Callison College One Pager

    Get PDF

    The Impact of Music Sharing on Album Purchases and Concert Attendance : the Case of Spain

    Get PDF
    En la última década, el intercambio de archivos a través de Internet ha pasado de ser un fenómeno circunscrito a una minoría de la población, a convertirse en un fenómeno de gran impacto a nivel social, mediático, económico y político. El motivo no es otro que la percepción de un elemento de causalidad directa entre el incremento de las descargas y la disminución de las compras de bienes culturales como los discos de música. Es lo que hemos denominado la perspectiva de la sustitución utilitarista. No obstante, otros estudios afirman que el intercambio a través de redes P2P no sustituye a la compra. Es lo que hemos denominado la perspectiva de la complementariedad del consumo. El presente trabajo pretende observar empíricamente esas relaciones (descarga y compra, y descarga y asistencia a conciertos) en el comportamiento de los individuos, utilizando para ello la Encuesta de hábitos de prácticas culturales (SGAE, Ministerio de Cultura, 2007) y aplicando dos técnicas multivariables: la regresión lineal de mínimos cuadrados (para el número de discos comprados) y la regresión logística (para la asistencia a conciertos). Como resultado, hemos averiguado que cuantos más discos intercambian los individuos, más discos compran en formato físico y más probabilidades tienen de acudir a conciertos. Además, encontramos otras variables que influyen en ambos comportamientos, como el nivel educativo, la edad, la situación socioprofesional o el ciclo vital asociado a la paternidad o la maternidad.In the last decade, digital file sharing has gone from being a phenomenon confined to a small share of the population to become a phenomenon with a great social, media, economic and political impact. The reason is none other than the perception of a direct causal element between increasing downloading practices and decreasing purchases of cultural goods such as music albums, what we call the Replacing Utilitarian Perspective. In contrast, other studies argue that downloading practices on P2P networks do not replace the purchase of music albums, what we call the Complementary Consumption Perspective. This paper aims to empirically examine this relationship (download-purchase and download-attending concerts) in individuals’ behavior using the Cultural Habits and Practices Survey (SGAE, Spanish Ministry of Culture, 2007) and applying two multivariate techniques: linear regression for the number of albums purchased and logistic regression for attendance to concerts. Our results reveal that the more albums individuals share, the more albums they purchase in physical format, and the more likely they are to attend concerts. In addition, other variables influence both behaviors such as educational level, age, socio-professional situation or life cycle associated with paternity/maternity experience

    De copias y copistas (I): la formación del manuscrito magliabechiano VII, 353 de la Biblioteca Nazionale Centrale de Florencia

    Get PDF
    El códice VII, 353 es, junto con el VII, 354, uno de los principales manuscritos poéticos que el noble florentino Girolamo da Sommaia reunió durante su estancia en Salamanca desde 1598 ó 1599 hasta 1607. De su vida como estudiante universitario en los años 1603 a 1607, Sommaia escribió dos Diarios, que son los códices magliabechianos VIII, 29 (1603-1605) y VIII, 30 (1605-1608), publicados por Haley (1977)1. Aunque extranjero, fue gran entusiasta de la poesía castellana en todas sus modalidades, conociendo las obras de los más celebrados escritores tanto vivos (Góngora, Lope de Vega, Quevedo, Cervantes...) como muertos (Fr. Luis de León, Francisco de Aldana, Diego Hurtado de Mendoza...). Prueba de ese entusiasmo es el cancionero de poesías varias VII, 353, cuya parte poética fue recopilada entre junio de 1604 y julio de 1606, aproximadamente

    Representaciones de la sociedad: la modernidad a la posmodernidad

    Get PDF
    El tránsito de las representaciones "modernas" de la sociedad a las llamadas "posmodernas " puede ser descrito como efecto ambivalente de una creciente diferenciación social. Más allá del carácter obsoleto de ciertos paradigmas sociológicos tradicionales, por una parte, dicha diferenciación introduce el debate sobre la posible representación de la sociedad, sin embargo, por otra, excluye la fundamental dimensión "relacional". Buena muestra de este proceso es el funcionalismo "sistémico" luhmanniano. En este paradigma lo social no es más que "comunicación". Sin embargo, siguen existiendo espacios que pueden permitir recuperar el carácter concreto de las relaciones sociales y, especialmente, salvaguardar su sentido "humano"

    El impacto del intercambio de música sobre la compra de discos y la asistencia a conciertos. El caso de España

    Get PDF
    En la última década, el intercambio de archivos a través de Internet ha pasado de ser un fenómeno circunscrito a una minoría de la población, a convertirse en un fenómeno de gran impacto a nivel social, mediático, económico y político. El motivo no es otro que la percepción de un elemento de causalidad directa entre el incremento de las descargas y la disminución de las compras de bienes culturales como los discos de música. Es lo que hemos denominado la perspectiva de la sustitución utilitarista. No obstante, otros estudios afirman que el intercambio a través de redes P2P no sustituye a la compra. Es lo que hemos denominado la perspectiva de la complementariedad del consumo. El presente trabajo pretende observar empíricamente esas relaciones (descarga y compra, y descarga y asistencia a conciertos) en el comportamiento de los individuos, utilizando para ello la Encuesta de hábitos de prácticas culturales (SGAE, Ministerio de Cultura, 2007) y aplicando dos técnicas multivariables: la regresión lineal de mínimos cuadrados (para el número de discos comprados) y la regresión logística (para la asistencia a conciertos). Como resultado, hemos averiguado que cuantos más discos intercambian los individuos, más discos compran en formato físico y más probabilidades tienen de acudir a conciertos. Además, encontramos otras variables que influyen en ambos comportamientos, como el nivel educativo, la edad, la situación socioprofesional o el ciclo vital asociado a la paternidad o la maternidadIn the last decade, digital file sharing has gone from being a phenomenon confined to a small share of the population to become a phenomenon with a great social, media, economic and political impact. The reason is none other than the perception of a direct causal element between increasing downloading practices and decreasing purchases of cultural goods such as music albums, what we call the Replacing Utilitarian Perspective. In contrast, other studies argue that downloading practices on P2P networks do not replace the purchase of music albums, what we call the Complementary Consumption Perspective. This paper aims to empirically examine this relationship (download-purchase and download-attending concerts) in individuals’ behavior using the Cultural Habits and Practices Survey (SGAE, Spanish Ministry of Culture, 2007) and applying two multivariate techniques: linear regression for the number of albums purchased and logistic regression for attendance to concerts. Our results reveal that the more albums individuals share, the more albums they purchase in physical format, and the more likely they are to attend concerts. In addition, other variables influence both behaviors such as educational level, age, socio-professional situation or life cycle associated with paternity/maternity experience